Text Sentiment Classification Based on Mixed Cloud Vector Model Clustering and Kernel Fisher Discriminant

نویسندگان

  • Yujuan Xing
  • Ping Tan
چکیده

In today’s world, the web has dramatically changed the way that people express their opinions. People use the internet to express their opinion, attitude, feeling and emotion about films, goods, news etc. It is challenging to automatically classify mass subjectivity comments into different sentiment orientation categories (e.g. positive/negative). Furthermore, the ambiguity and randomness, which are existed in natural language, lead to lower classification accuracy in text sentiment classification. In this paper, we propose a novel chinese text sentiment classification algorithm based on mixed cloud vector model clustering and kernel fisher discriminant. In this algorithm, we firstly analysis the role of cloud model theory in conversion between qualitative concept and quantitative values, and explore a mixed feature cloud model (MFCM) based on cloud model to represent a single document. In MFCM, both effect of different part-ofspeech features and ambiguity of sentiment tendency are considered. And then, documents are clustered according to their similarity between MFCM. Finally, kernel fisher discriminant (KFD) is adopted as the classifier to judge views. The experimental results demonstrate that our proposed method outperforms traditional approaches. Keyword: Text sentiment classification; cloud model; kernel fisher discriminant; support vector machine

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Joint Semantic Vector Representation Model for Text Clustering and Classification

Text clustering and classification are two main tasks of text mining. Feature selection plays the key role in the quality of the clustering and classification results. Although word-based features such as term frequency-inverse document frequency (TF-IDF) vectors have been widely used in different applications, their shortcoming in capturing semantic concepts of text motivated researches to use...

متن کامل

Kernels for Longitudinal Data with Variable Sequence Length and Sampling Intervals

We develop several kernel methods for classification of longitudinal data and apply them to detect cognitive decline in the elderly. We first develop mixed-effects models, a type of hierarchical empirical Bayes generative models, for the time series. After demonstrating their utility in likelihood ratio classifiers (and the improvement over standard regression models for such classifiers), we d...

متن کامل

On Model-Based Clustering, Classification, and Discriminant Analysis

The use of mixture models for clustering and classification has burgeoned into an important subfield of multivariate analysis. These approaches have been around for a half-century or so, with significant activity in the area over the past decade. The primary focus of this paper is to review work in model-based clustering, classification, and discriminant analysis, with particular attenti...

متن کامل

A Kullback-Leibler Divergence Based Kernel for SVM Classification in Multimedia Applications

Over the last years significant efforts have been made to develop kernels that can be applied to sequence data such as DNA, text, speech, video and images. The Fisher Kernel and similar variants have been suggested as good ways to combine an underlying generative model in the feature space and discriminant classifiers such as SVM’s. In this paper we suggest an alternative procedure to the Fishe...

متن کامل

A High-Performance Model based on Ensembles for Twitter Sentiment Classification

Background and Objectives: Twitter Sentiment Classification is one of the most popular fields in information retrieval and text mining. Millions of people of the world intensity use social networks like Twitter. It supports users to publish tweets to tell what they are thinking about topics. There are numerous web sites built on the Internet presenting Twitter. The user can enter a sentiment ta...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015